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Abstract 

This paper investigates the decentralized detection of Hidden Markov Processes using the Neyman- 
Pearson test. We consider a network fonned by a large number of distributed sensors. Sensors' observations 
are noisy snapshots of a Markov process to be detected. Each (real) observation is quantized on log2(Ai') 
bits before being transmitted to a fusion center which makes the final decision. For any false alarm level, it 
is shown that the miss probability of the Neyman-Pearson test converges to zero exponentially as the number 
of sensors tends to infinity. The error exponent is provided using recent results on Hidden Markov Models. 
In order to obtain informative expressions of the error exponent as a function of the quantization rule, 
we further investigate the case where the number A'^ of quantization levels tends to infinity, following the 
approach developed in fTI. In this regime, we provide the quantization rule maximizing the error exponent. 
Illustration of our results is provided in the case of the detection of a Gauss-Markov signal in noise. In 
terms of error exponent, the proposed quantization rule significantly outperforms the one proposed by (T] 
for i.i.d. observations. 

I. Introduction 

The design of powerful tests allowing to detect the presence of a stochastic signal using large Wireless 
Sensor Networks (WSN) is a crucial issue in a wide range of applications. In many practical contexts, 
each sensor of the network gathers information on its environment and forwards it to a distant Fusion 
Center (EC) which makes the final decision. Binary hypothesis tests are special cases where the EC has 
to decide between two possible hypotheses Hq and Hi. Consider a network composed of rt + 1 sensors, 
denote by Yk the random variable representing the observation of the kth sensor (k — . . .n) and by 
Po{yo:n) (resp. Pi{yo:n)) the probability distribution of the global observation vector Yg.n under Hq (resp. 
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Hi). The task of the FC is then to decide between the following hypotheses: 

Ho : Yo.,n ^ Po{yo:n) 

Hi : Yo:n ^ Pl{yO:n) ■ 

For any test function, we refer to the probability of false alarm (resp. the miss probability) as the probability 
that the FC decides hypothesis Hi (resp. Ho) under hypothesis Hq (resp. Hi). The Neyman-Pearson test 
consists in rejecting the null hypothesis whenever the (normalized) log-likelihood ratio (LLR) L„ defined 
by 

J- 1 , PoiYo:n) 

in=-log— — (1) 
n Pl(Yo:n) 

lies below a certain threshold. This threshold is set in such a way that the probability of false alarm is 
no larger than a certain level a G (0, 1) ||2J. The associated miss probability /3„(a) is the key metric to 
characterize the performance of the detection procedure. Unfortunately, no simple closed form expression 
is available in general. However, under some assumptions on the law of the observations, the following 
lemma due to Chen ||3] shows that the miss probability behaves as (3„ {a) ~ exp(— rt/f ) for large n, where 
K is a certain positive constant, independent of a, which we will refer to as the error exponent. 

Lemma 1 (f3l). Assume that converges in probability to a certain constant K > under Hq. Then, 
for any a € (0, 1), 

lim log/3„(a) = K . 

n— ^oo Tl 

The error exponent K provides crucial information on the performance of the NP test when the number 
n of sensors is large. By Lemma [T] the evaluation of K simply reduces to the asymptotic analysis of the 
LLR L„ under Hg. In this framework, a number of works in the Uterature derived and analyzed the error 
exponent for various observation models (see for instance Q, ||5) and reference therein). However, most 
of these works assume that the FC has a perfect knowledge of the sensors' observations. Unfortunately, 
in a WSN, the amount of information forwarded by each sensor node to the FC is usually limited, due to 
imperfect links between nodes of the network. Therefore, a large numbers of papers have been devoted 
to the construction and the analysis of decentralized detection schemes |[T], ||6|-||9). In this framework, 
each sensor has the ability to compress/quantize its observation before transmission to the FC, with the 
aim of decreasing the information transport burden to be supported by the network. For instance, fT), 
Q investigate the detection of deterministic signal in noise, while |[l], ||6j, ||8) study the case where all 
observations are independent and identically distributed (i.i.d.). 

Few is known when the observations are correlated. In | [T0| , the authors explore the effect of node 
density on the error exponent for the detection of correlated signals in Gaussian noise, assuming that the 
sensors communicate with the FC through Gaussian additive channels. In this paper, we investigate the 
more general case where the observations follow a Hidden Markov Model (HMM). More precisely, the 
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source signal to be detected is supposed to be a Markov chain. Sensors observations are distorted (noisy) 
versions of the latter source signal. Each observation is quantized on log2{N) bits before being transmitted 
to the FC. Our aim is to evaluate the impact of quantization on the error exponent, and to characterize 
relevant quantizers allowing to maximize the error exponent. 

The rest of the paper is organized as follows. In Section |n] we describe the observation model and 
evaluate the error exponent in the ideal case where the FC has perfect access to the observations. In 



Section III we introduce high-rate quantizers, and evaluate the degradation on the error exponent when the 



decision is made using quantized observations instead of the ideal ones. We determine relevant quantization 



strategies allowing to reduce this degradation. Section IV illustrates our results for the detection of a Gauss- 
Markov signal in noise. 

II. Detection of Hidden Markov Models 

A. Model of Observation 

In this section, we describe the probabilistic model underlying the observations. We assume that the 
observed time series Yk follows a Hidden Markov Model under both hypotheses Hq and Hi, with different 
transition kernels. Our hypothesis testing problem shall thus reduce to the question: Which kernel underUes 
the observation process ? 

To be more specific, let (X, X) and (F, y) be two measurable spaces. Consider two transition kernels 
Qo and Qi defined on X x X. We make the assumption that both kernels are positive and denote by I'i 
the invariant measure of Qi |11) . Let G be a transition kernel on X x y. Now consider two probability 
measures Pq and Pi on a relevant measurable space (fi,?) and a stochastic process {Xk,Yk)kez such 
that for each i £ {0, 1}, the following holds true under probability measure Pf. 

. (Xfe,Yfc)fcgz is stationary. 

• {Xk,Yk)k>o is a Markov chain with kernel Ti defined for each {x,y) G X xY and each C G X (g) ^ 
by 

mx,y),C] = J j^Q,{x,dx')G{x',dv') . (2) 

• has distribution Vi. 

In particular, definition (j2| implies that the state Xk is a Markov chain with transition kernel Qi under Pi. 
The observation Yk is such that 

P,[Yk e B\Xk] = G{Xk, B) , VB e y . (3) 

The aim is to decide between probability measures Pq and Pi based on the observation of Fo:n, the state 
sequence X^.^ being unobserved. Note that Equation ([3]) implies that the distribution of the observation 
Yk conditionnally to the state Xk is identical under both Pq and Pi. Roughly speaking, one can think 
of Yk as a noisy version of a process Xk to be detected, where the distortion (typically a measurement 
noise) does not depend on the hypothesis. We make the following assumptions. 
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Al. There exist two probability measures A and fi on (X,X) and {Y,'^) respectively s.t. for each 
X £ X and each i € {0, 1}, Qi{x, .) admits a density x' ^ qi{x, x') w.r.t. A and G{x, .) admits a density 
y '-^ 9{x,y) w.r.t. p.. 

A2. There exist two real numbers a^, s.t., for each i e {0, 1} and each x,x' £ X, < < 
qi{x, x') < a+. 

A3. < / g{x, y)X{dx) < oo for each y GY. 

A4. snp^ yg{x,y) < oo . 

A5. Eq |log J g{x,Yo)X{dx)\ < oo. 

We mention that A2 implies some restrictions on the state model. For instance, this assumption generally 
excludes state models with unbounded support such as the Gaussian autoregressive model considered in Q. 
Note that milder assumptions, under which most of the theoretical results used in this paper still holds 
true, can be found in p2) . Such an extension is however out of the scope of this paper. 

For each i £ {0, 1} and each integers k < £, denote by Pi{yk-i) the density of measure Pi\Yk-( € .] 
w.r.t. to measure ^'^(^-''"+1). Denote by Pi{yi\Yk-i-i) the density of Pi\Yi <E . \Yk-i-i\ w.rt. pi. 

B. Error Exponent for Unquantized Observations 

In this paragraph, we derive the error exponent associated with the NP test in the ideal case where the 
FC is supposed to have perfect access to the sensors' observations Y^^-n. From Lemma [T] the derivation 
of the error exponent K reduces to the study of the LLR L„ under Pq. The following result can be easily 
proved from p2| . 

Theorem 1 (| |12) ). Under Assumptions Al— 5, the following holds true. 

i) For each i £ {0, 1} and each k > 0, the following limit exists with probability one under Pg: 



Elements of the Proof. 

Here we only recall the general ideas and refer to |12 | for the detailed motivation of the forthcoming 
statements. The first step of the proof consists in rewriting the LLR Ln using the so-called chain rule. 



'Ci(F_oo:fe) = lim \0gPt{Yk\Y_m;k-l) ■ 



(4) 




Equation ([T]i becomes 




(5) 
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Thus, the asymptotic analysis of L„ as n — > cx) reduces to the separate study of each of the two terms 
of the rhs of the above equation. Focus for instance on the second term. The main step of the proof is to 
estabUsh the following forgetting property of the past observations, for each fc > and m' > m > 1: 

I logp,{Yk\Y_^..k-i) - logPi{Yk\Y_^>..k-i)\ < ^ /+"-^ , (6) 

where we defined p = / < \. The core of the proof of the above inequahty is essentially related 
to the mixing condition A2 which implies a Dobrushin's contraction condition on the forward smoothing 
kernels. From inequality (j6jl, {\ogpi{Yk\y-m:k-i)) m> -k therefore a Cauchy sequence. Denote its limit 
by £i(F_oo:fc)- Plugging Q into (|6| and using the triangular inequality, it is straightforward to show that 

^0 . 

n— ^oo 

Assumptions A4 and A5 ensure that |£ 1(1^-00:0) I < co- As process Yk is stationary under Pq^ the 
ergodic theorem along with the latter equation leads to 

1 " 

- Vlogpi(rfe|r0:fe-l) = Eo [£l(y_oo:0)] • 

fc=0 

By the same token, the first term of the rhs of Equation ([5]l converges towards Kq — Eq [£o(1^-oo:o)]- 
Therefore, L„ converges to Kq — Ki. This proves Theorem [T| 

III. Main results 

From now on, we assume that each sensor's observation belongs to an interval of the form Y = [a, b] 
for some a < b. We furthermore assume that ^ is the Lebesgue measure on Y normalized in such a way 
that ^(y) = 1. 

A. Definitions 

We now investigate the case where the final decision is made from quantized observations. For a given 
integer > 1, we define an iV-point quantizer as a triplet Q^r — (Sjv, S^v, Cw) where Sat is a set of 
intervals 5^,0, • ■ • , Sn,n-i which form a partition of Y, where Sat = {<^Ar,o, ■ • • , £,n.n-i} is an arbitrary 
set of distinct elements and where : y — is a function s.t. ^Ar(y) = £,N.j whenever y G Sn.j- We 
will refer to each interval Sjyj as a cell of the quantizer If y^ represents the value of the kth sensor's 
observation, we denote by ZN^k = £.N{Yk) the quantized observation on log2(iV) bits. From now on, we 
assume that only the quantized observations Z^^-n = {Zn,q ■ ■ ■ Z^.n) are available at the FC. Following 
the terminology of |[T|, we introduce some useful characteristics of a given quantizer Qat: 

• The length of cell j is defined by tjsi j — Jg^ dy. 

• The specific point density in cell j is defined by Cw.j = wi^- For convenience, we also define 
function (n on Y by Civ(y) ~ (nj whenever y e Snj- 



n 1 ^ 

- Viogpi(yfe|yo:fc_i)-- V£i(y- 

n ^ — ^ n ^ — ^ 



fe=0 
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B. Neyman-Pearson Test on Quantized Observations 

We derive the NP procedure for testing Pq vs Pi. Consider the following weighted counting measure 
of the ^jv j's: 

N-l 
3=0 

Clearly, the joint distribution Pi[ZN.o:n G . ] of the quantized observation vector is absolutely continuous 
w.r.t. fi^^"^^\ We denote its density by Pi,Ar(2o:n)- The NP test consists in rejecting the null hypothesis 
for small values of the LLR L„.n associated with the quantized observations: 

T 1 1 PO,N {Zn.O-.ti) 

Ln,N = - log 1— r . 

n Pl,N {ZN.,Q:n) 

We now study the error exponent of the latter test. Note that the event [Z^ k = ^nj] is equivalent to the 
event that Yk falls into cell SN,j. Using (jsjl, we thus obtain for each i G {0, 1} 

Pi[ZN,k — £.N.j I Xk] = G{Xk, Snj) ■ 

As a consequence, process {Xk, Zi^_k)k still forms a HMM, where the transition kernel Gat which links the 
state to the quantized observation Zjv,a: is given by Gn{x, ^n.j) ~ G{x, Sjyj) for each j E {1, . . . , N}. 
We remark that Gjv admits the following density w.r.t. fij^: 

9N{x,^N,j) ^ -f— I g{x,y)dy . (7) 

If Assumptions Al— 5 hold, then it is straightforward to show that the new (quantized) HMM {Xk, ZN,k)k 
still verifies conditions of same kind. This ensures that Theorem [T| still applies to the distorted LLR Ln,N- 
Thus, we obtain the following corollary. Define z i— ?> Pi,N{z\ZM-m:-i) as the derivative of P^IZj^ q e 

•|^JV-m:-l] W.r.t. ^JV- 

Corollary 1. Under Pq, Ln^N converges a.s. as n ^ oo towards the constant Kn defined by 

Kn = Eq [^^0^^(^^,-00:0)] — Eq [■Ci^n{Zn-oc,:o)] j (8) 

where for each i e {0, 1}, 'Ci,Ar(2^Ar.-oo:o) is the almost sure limit under Pq o/logpi,Ar(ZAr.o|^Ar.-m:-i) 
as m — )■ 00. 

The above corollary provides the error exponent Ki\} associated with the NP test on quantized observa- 
tions. A natural question is: How does the choice of the quantizer Qjv affect the detection performance ? 
Unfortunately, the error exponent depends on the cells Snj in a rather involved way which does not 
immediately allow to evaluate the impact of the quantizer. In the sequel, we follow the approach of [11, 
[ [T3) and focus on the case where the order N of the quantizer tends to infinity. We refer to such quantizers 
as high-rate quantizers. This approach leads to a convenient and informative asymptotic expression of Kn, 
which can be easily maximized as a function of the quantizer. 
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C. High-Rate Quantizers 

Consider a family of quantizers (QAr)Ar>i. We make the following assumption. 

A6. As N — >■ oo, Qn converges uniformly to a certain continuous function C, s.t. infygy C,{y) > 0. 

We will refer to C as the model point density of the family (Q7v)jv>i- For each y E Y, C,{y) can be 
interpreted as the asymptotic density of cells in the neighborhood of y. Intuitively, high-rate quantizers 
should be constructed in such a way that ({y) is large at those points y for which a fine quantization is 
essential to discriminate the two hypotheses. Theorem |2] below provides a more rigorous formulation of 
this intuition. We need further assumptions. 

A7. For each x, function y g{x,y) is of class C3 on Y. 



A8. iidx,y g{x,y) > and sup. 



< 00. 



We now provide the main result. Recall that po(y) is the pdf of Yq under Pq. 

Theorem 2. Under Assumptions Al— 8, the following statements hold true, 
i) The following limit exists with probability one under Pq: 



def 



Moreover, \1{Y^ 



< C for some constant C. 



lim lim 



9 log 20 



Pi 



dyo 



m:k ) 



ii) As N tends to infinity, N^{K — K^) converges to a constant Dq given by 



1 

24 



po{y)F{y) 

ay? 



(9) 



where F{y) = Eq [£(r_oo:c 
Hi) Moreover, 

where equality holds when 



Yo = y . 

{h-af 



24 



C(y)- 



P,{y)F{yt" ^i{dy) 
[p,[y)F{yf 



(10) 



j[p,{s)F{s)f'ds ■ 

Theorem |2] states that when the order of the quantizer tends to infinity, the error exponent associated 
with the NP test converges at speed to the error exponent K that one would have obtained in the 
absence of quantization. Roughly speaking, if /3„,Ar(a) represents the miss probability of the NP test of 
level a, the approximation 



-n{K- 



l3n,N{0L) 

is valid when both the number n of sensors and the order N of quantization are large, but n ^ N. The 



loss in error exponent depends on the quantizer family only via its model point density (. Expression ( lOl 
provides the optimal choice of C i.e., the model point density which minimizes the loss in error exponent. 
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Note that this expression is quite similar to Bennett's one | i4| which gives the MSE-optimal model point 
density (in this case F{y) = 2 on Y). 

In the particular case where the observations are i.i.d., our expression of the loss is consistent with 
the one obtained by Gupta and Hero (see Equation (20) in ||T|). 

Insights on the Proof. 

Due to the lack of space, we only provide some of the basic ideas underlying the proof of Theorem [2] 
A rigorous proof will be provided in an extended version of this paper. 

Since the error exponent Kn does not depend on the particular choice of the quantization alphabet 
Sjv, we assume that each ^jvj coincides with the center of cell Sn.j- We separately study each term 
Ki^N — Eq [£i,Ar(^Ar,-oo:o)] that appears in Equation (|8]l. First focus on Ki^n and define m = m.{N) a 
sequence of integers such that ^^^^ — > oo and — > as — > oo. We write 

Ki^N — Ki = Eq [£i,7v(^7V,-co:o) " ^liX-oo-.a)] 

where — Eq [logpi_jv(^Af,ol^Af,-jri:-i) — logPi(^o|^-m:-i)] and 6^ is the remainder term. The rhs 
of Equation (|6]) converges to zero exponentially as m tends to infinity and the sequence m{N) grows at 
a superlogarithmic rate. Consequently, the remainder (5 at is a little-o of The study of T/v is based on 
the following expansions: 

logPl,A'(^7V,o|^W -m:-l) = 

logPl,Ar(ZAr _,„:o) - logpi,jv(-^Ar.-m:-l) , (H) 
Pl,N{Z-m:u) = 

/-. u u 

. . .1 vi{dx^rn) W Qi{xi_i,dxi) J]^ gN{xi,Zi). (12) 

2 — — rn+l i—~rn 

Plugging Taylor-Lagrange expansion of y i— > g{x^ y) at point ^ in ^ leads to the following approximate: 

9n{x,^nj) ^ 9{x,^N,j) + ^q;^{x,£.n,j) ■ (13) 
Note that the second term of the rhs of the above equation vanishes as iV — > oo, due to the fact that the 



cell lengths ijyj skrink. We now plug the above expansion into (12i. The dominant term is nothing else 



than pi{z^„i:u) (only keep term g{x,£,N.j) in (13i). Therefore, (111 is approximately equal to 

logpi{ZN,o\ZN,-7n:-l) + ^Af(^Af,m:o) 

where Ai\[{ZN^jn:o) contains the dominated terms. Therefore, the quantity of interest Tn can be written as 
Tat ~ An+'Bn where yiAT = Eq [AN{ZN^,n:o)] and Sat = Eq [\ogpi{ZNfi\ZN,^,n:~i) - logpi(ro|F_„:_i 



After some algebra, and using recent results on HMM 1 12|, we prove that N'^Ajy and N'^'B^ respectively 
converge to some constants ca and as — oo. Thus, N'^{Ki^m ~ Ki) converges to the sum ca + cb- 
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Fig. 1. Probability and model point densities (cr = 1) 



Evaluating these constants, and proceeding in the same way for the study of N'^{Ko^n ~ Kq), we obtain 
the first and second points of Theorem [2] From Holder's inequality, it is straightforward to prove the third 
point. 

IV. Numerical Results and Discussion 

We consider the situation of detecting a Gaussian first-order autoregressive process (AR-1) embedded 
in noise [4|: 

Ho: Yk = Wk 

(14) 

Hi : Yk ^Xk + Wk , 
where Xk is a stationary Gaussian AR-1 process: 

Xk = aXk-i + ^l-a^Uk, (15) 

where a e (0, 1) is the correlation coefficient, Uk ^'^'^ ^(0, 1) is the innovation process and Wk ^'^"^ 
y^{0, (T^) is the observation noise. We mention that in this case, all densities have infinite support so that, 
strictly speaking, the assumptions made in this paper are not satisfied. Nevertheless, the above model can 
be slightly modified to be consistent with our assumptions. For instance, in order that the transition kernel 



of (15 I strictly fits Assumptions A1-A2, it is sufficient to replace the distribution 7^(0, 1) of Uk with the 
corresponding truncated distribution on an arbitrarily large support. In order to simplify the presentation, we 
do not go into more details and keep model ([T4]i-([T5]l with slight abuse. We compare different quantization 
strategies in terms of the error exponent loss D(^: 

• optimal quantization: the model point density is given by Equation ( fTO] ), 

• i.i.d. -optimal quantization: the model point density is drawn as if observations were i.i.d. (see [1]) 
i.e., we only use the marginal pdf of the observation to design the quantization rule, 

• uniform quantization: all cells have the same size (in the interval [— lOcr, lOfi]). 

Figure [T| represents the model point densities for different values of parameter a. As the value of a has 
no impact on the marginal distribution of the observations, the density associated to the i.i.d. quantizer is 
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1 

o.s; 



( 

Fig. 2. Error exponent loss Z)^ for different quantization strategies (a = 1) 

clearly independent of a. The latter density Ciid{x) is equal to zero at point a; = 0, meaning that, if the 
observations were indeed i.i.d., fine quantization of the observations falling in the neighborhood of zero 
would be useless. On the opposite, the proposed quantization rule suggests to use a significant density 
of cells in the neighborhood of zero, especially when parameter a is large. Figure |2] represents the error 
exponent loss as a function of a for u = 1. As a matter of fact, when the model point density C = 
is plugged into Equation (j9]l, the integral diverges, due to the fact that Cudix) cancels at a; = 0. Intuitively, 
this indicates that if the i.i.d. quantization rule is used to quantize the non-i.i.d. observations ([l4|-([T5]l, the 
quantized error exponent does no longer converge to the perfect error exponent at speed N"^. Here, the 
proposed point density clearly outperforms the i.i.d. one in terms of asymptotic error exponent loss. 
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